智能论文笔记

DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning

Tongan Cai , Haomiao Ni , Mingli Yu , Xiaolei Huang , Kelvin Wong , John Volpi , James Z. Wang , Stephen T. C. Wong

分类：计算机视觉 | 人工智能

2021-09-24

在急诊室（ER）环境中，中风分类或筛查是一个普遍的挑战。由于MRI的慢速吞吐量和高成本，通常会进行快速CT而不是MRI。在此过程中通常提到临床测试，但误诊率仍然很高。我们提出了一个新型的多模式深度学习框架，深沉的中风，以通过识别较小的面部肌肉不协调的模式来实现计算机辅助中风的存在评估，并使怀疑急性环境中的中风的患者无能为力。我们提出的深雷克斯（Deepstroke）在中风分流器中容易获得一分钟的面部视频数据和音频数据，用于局部面部瘫痪检测和全球语音障碍分析。采用了转移学习来减少面部侵蚀偏见并提高普遍性。我们利用多模式的横向融合来结合低水平和高级特征，并为关节训练提供相互正则化。引入了新型的对抗训练以获得无身份和中风的特征。与实际急诊室患者进行的视频ADIO数据集进行的实验表明，与分类团队和ER医生相比，中风的表现要优于最先进的模型，并且取得更好的性能，比传统的敏感性高出10.94％，高7.37％的精度高出7.37％。当特异性对齐时，中风分类。同时，每个评估都可以在不到六分钟的时间内完成，这表明该框架的临床翻译潜力很大。

translated by 谷歌翻译

Applications of Unsupervised Deep Transfer Learning to Intelligent Fault Diagnosis: A Survey and Comparative Study

Zhibin Zhao , Qiyang Zhang , Xiaolei Yu , Chuang Sun , Shibin Wang , Ruqiang Yan , Xuefeng Chen

分类：机器学习

2019-12-28

最近的智能故障诊断（IFD）的进展大大依赖于深度代表学习和大量标记数据。然而，机器通常以各种工作条件操作，或者目标任务具有不同的分布，其中包含用于训练的收集数据（域移位问题）。此外，目标域中的新收集的测试数据通常是未标记的，导致基于无监督的深度转移学习（基于UDTL为基础的）IFD问题。虽然它已经实现了巨大的发展，但标准和开放的源代码框架以及基于UDTL的IFD的比较研究尚未建立。在本文中，我们根据不同的任务，构建新的分类系统并对基于UDTL的IFD进行全面审查。对一些典型方法和数据集的比较分析显示了基于UDTL的IFD中的一些开放和基本问题，这很少研究，包括特征，骨干，负转移，物理前导等的可转移性，强调UDTL的重要性和再现性 - 基于IFD，整个测试框架将发布给研究界以促进未来的研究。总之，发布的框架和比较研究可以作为扩展界面和基本结果，以便对基于UDTL的IFD进行新的研究。代码框架可用于\ url {https:/github.com/zhaozhibin/udtl}。

translated by 谷歌翻译

Aligning Visual and Lexical Semantics

Fausto Giunchiglia , Mayukh Bagchi , Xiaolei Diao

分类：计算机视觉 | 人工智能

2022-12-13

We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in turn, has a major impact on CV systems in the form of the Semantic Gap Problem (SGP). The paper, while extensively exemplifying the lack of coincidence as above, introduces a general, domain-agnostic methodology to enforce alignment between visual and lexical semantics.

translated by 谷歌翻译

Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation

Ying Chen , Siwei Qiang , Mingming Ha , Xiaolei Liu , Shaoshuai Li , Lingfeng Yuan , Xiaobo Guo , Zhenfeng Zhu

分类：机器学习 | 人工智能

2022-11-30

In recent years, semi-supervised graph learning with data augmentation (DA) is currently the most commonly used and best-performing method to enhance model robustness in sparse scenarios with few labeled samples. Differing from homogeneous graph, DA in heterogeneous graph has greater challenges: heterogeneity of information requires DA strategies to effectively handle heterogeneous relations, which considers the information contribution of different types of neighbors and edges to the target nodes. Furthermore, over-squashing of information is caused by the negative curvature that formed by the non-uniformity distribution and strong clustering in complex graph. To address these challenges, this paper presents a novel method named Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation (HG-MDA). For the problem of heterogeneity of information in DA, node and topology augmentation strategies are proposed for the characteristics of heterogeneous graph. And meta-relation-based attention is applied as one of the indexes for selecting augmented nodes and edges. For the problem of over-squashing of information, triangle based edge adding and removing are designed to alleviate the negative curvature and bring the gain of topology. Finally, the loss function consists of the cross-entropy loss for labeled data and the consistency regularization for unlabeled data. In order to effectively fuse the prediction results of various DA strategies, the sharpening is used. Existing experiments on public datasets, i.e., ACM, DBLP, OGB, and industry dataset MB show that HG-MDA outperforms current SOTA models. Additionly, HG-MDA is applied to user identification in internet finance scenarios, helping the business to add 30% key users, and increase loans and balances by 3.6%, 11.1%, and 9.8%.

translated by 谷歌翻译

Forecasting User Interests Through Topic Tag Predictions in Online Health Communities

Amogh Subbakrishna Adishesha , Lily Jakielaszek , Fariha Azhar , Peixuan Zhang , Vasant Honavar , Fenglong Ma , Chandra Belani , Prasenjit Mitra , Sharon Xiaolei Huang

分类：机器学习

2022-11-05

The increasing reliance on online communities for healthcare information by patients and caregivers has led to the increase in the spread of misinformation, or subjective, anecdotal and inaccurate or non-specific recommendations, which, if acted on, could cause serious harm to the patients. Hence, there is an urgent need to connect users with accurate and tailored health information in a timely manner to prevent such harm. This paper proposes an innovative approach to suggesting reliable information to participants in online communities as they move through different stages in their disease or treatment. We hypothesize that patients with similar histories of disease progression or course of treatment would have similar information needs at comparable stages. Specifically, we pose the problem of predicting topic tags or keywords that describe the future information needs of users based on their profiles, traces of their online interactions within the community (past posts, replies) and the profiles and traces of online interactions of other users with similar profiles and similar traces of past interaction with the target users. The result is a variant of the collaborative information filtering or recommendation system tailored to the needs of users of online health communities. We report results of our experiments on an expert curated data set which demonstrate the superiority of the proposed approach over the state of the art baselines with respect to accurate and timely prediction of topic tags (and hence information sources of interest).

translated by 谷歌翻译

Modeling sequential annotations for sequence labeling with crowds

Xiaolei Lu , Tommy W. S. Chow

分类：自然语言处理 | 机器学习

2022-09-20

人群顺序注释可能是一种有效且具有成本效益的方式，用于构建用于序列标签的大型数据集。不同于标记独立实例，对于人群顺序注释，标签序列的质量取决于注释者在捕获序列中每个令牌的内部依赖性方面的专业知识水平。在本文中，我们提出了与人群（SA-SLC）进行序列标记的序列注释。首先，开发了有条件的概率模型，以共同模拟顺序数据和注释者的专业知识，其中引入分类分布以估计每个注释者在捕获局部和非本地标签依赖性以进行顺序注释时的可靠性。为了加速所提出模型的边缘化，提出了有效的标签序列推理（VLSE）方法，以从人群顺序注释中得出有效的地面真相标签序列。 VLSE从令牌级别中得出了可能的地面真相标签，并在标签序列解码的正向推断中进一步介绍了李子标签。 VLSE减少了候选标签序列的数量，并提高了可能的地面真实标签序列的质量。自然语言处理的几个序列标记任务的实验结果显示了所提出的模型的有效性。

translated by 谷歌翻译

Weak Disambiguation for Partial Structured Output Learning

Xiaolei Lu , Tommy W. S. Chow

分类：自然语言处理 | 机器学习

2022-09-20

局部结构化输出学习的现有歧义策略不能很好地概括地解决有些候选人可能是假阳性或与地面真相标签相似的问题。在本文中，我们提出了针对部分结构化输出学习（WD-PSL）的新型弱歧义。首先，分段较大的边距公式被推广到部分结构化输出学习，该学习有效地避免处理大量的复杂结构候选结构化输出。其次，在拟议的弱歧义策略中，每个候选标签都具有一个置信值，表明其真实标签的可能性是多大的，该标签旨在减少学习过程中错误地面真相标签分配的负面影响。然后配制了两个大边缘，以结合两种类型的约束，这是候选人和非候选者之间的歧义，以及候选人的弱歧义。在交替优化的框架中，开发了一种新的2N-SLACK变量切割平面算法，以加速每种优化的迭代。自然语言处理的几个序列标记任务的实验结果显示了所提出的模型的有效性。

translated by 谷歌翻译

Partial sequence labeling with structured Gaussian Processes

Xiaolei Lu , Tommy W. S. Chow

分类：机器学习

2022-09-20

现有的部分序列标记模型主要集中在最大边缘框架上，该框架未能提供对预测的不确定性估计。此外，这些模型采用的独特地面真理歧义策略可能包括用于参数学习的错误标签信息。在本文中，我们提出了部分序列标签（SGPPSL）的结构化高斯过程，该过程编码了预测中的不确定性，并且不需要额外的努力来选择模型选择和超参数学习。该模型采用因子式近似，将线性链图结构划分为一组，从而保留了基本的马尔可夫随机场结构，并有效地避免处理由部分注释数据生成的大量候选输出序列。然后在模型中引入了置信度度量，以解决候选标签的不同贡献，这使得能够在参数学习中使用地面真相标签信息。基于所提出模型的变异下限的派生下限，在交替优化的框架中估计了变分参数和置信度度量。此外，提出了加权viterbi算法将置信度度量纳入序列预测，该预测考虑了训练数据中的多个注释，从而考虑了标签歧义，从而有助于提高性能。 SGPPSL在几个序列标记任务上进行了评估，实验结果显示了所提出的模型的有效性。

translated by 谷歌翻译

Ctrl-VIO: Continuous-Time Visual-Inertial Odometry for Rolling Shutter Cameras

Xiaolei Lang , Jiajun Lv , Jianxin Huang , Yukai Ma , Yong Liu , Xingxing Zuo

分类：机器人

2022-08-25

在本文中，我们提出了用于滚动快门摄像机的概率连续时间视觉惯性频道（VIO）。连续的时轨迹公式自然促进异步高频IMU数据和运动延伸的滚动快门图像的融合。为了防止棘手的计算负载，提出的VIO是滑动窗口和基于密钥帧的。我们建议概率地将控制点边缘化，以保持滑动窗口中恒定的密钥帧数。此外，可以在我们的连续时间VIO中在线校准滚动快门相机的线曝光时间差（线延迟）。为了广泛检查我们的连续时间VIO的性能，对公共可用的WHU-RSVI，TUM-RSVI和Sensetime-RSVI Rolling快门数据集进行了实验。结果表明，提出的连续时间VIO显着优于现有的最新VIO方法。本文的代码库也将通过\ url {https://github.com/april-zju/ctrl-vio}开源。

translated by 谷歌翻译

End-to-end Graph-constrained Vectorized Floorplan Generation with Panoptic Refinement

Jiachen Liu , Yuan Xue , Jose Duarte , Krishnendra Shekhawat , Zihan Zhou , Xiaolei Huang

分类：计算机视觉

2022-07-27

给定的用户输入的自动生成平面图在建筑设计中具有很大的潜力，最近在计算机视觉社区中探索了。但是，大多数现有方法以栅格化图像格式合成平面图，这些图像很难编辑或自定义。在本文中，我们旨在将平面图合成为1-D向量的序列，从而简化用户的互动和设计自定义。为了产生高保真矢量化的平面图，我们提出了一个新颖的两阶段框架，包括草稿阶段和多轮精炼阶段。在第一阶段，我们使用图形卷积网络（GCN）编码用户的房间连接图输入，然后应用自回归变压器网络以生成初始平面图。为了抛光最初的设计并生成更具视觉吸引力的平面图，我们进一步提出了一个由GCN和变压器网络组成的新颖的全景精炼网络（PRN）。 PRN将初始生成的序列作为输入，并完善了平面图设计，同时鼓励我们提出的几何损失来鼓励正确的房间连接。我们已经对现实世界平面图数据集进行了广泛的实验，结果表明，我们的方法在不同的设置和评估指标下实现了最先进的性能。

translated by 谷歌翻译